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COPYRIGHT NOTICE 

A portion of the disclosure of this patent contains material which is subject to 
copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure, as it appears 
15 in the U.S. Patent and Trademark Office patent files or records, but otherwise reserves 
all copyright rights whatsoever. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to electronic systems and more particularly to 
20 scheduling and allocation of resources within such a system. 

Description of the Related Art 

Systems having shared resources are common. One example of such a system 
is a computer network in which computing devices such as workstations, personal 
computers, servers, storage devices, firewalls and other computing devices function as 
25 nodes of a network with at least one network element connecting the computing 
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devices. The various nodes transmit and/or receive various kinds of information over 
the network, typically in packets. 

The rules by which nodes transmit and receive packets are defined in various 
protocols. A network protocol provides rules to route a packet of information from a 
source to a destination. A packet is generally a portion of a message transmitted over 
a network that typically includes routing or destination information in addition to data 
information. Packets may vary in size from only a few bytes to many thousands of 



In a switched network, congestion arises when a shared resource such as a 
path, internal or external to the switch, is requested to forward more packets than its 
capacity allows. In switched networks (and other systems) an arbiter may be used to 
schedule usage of the shared resources to prevent conflicts resulting from requests for 
simultaneous access to the same shared resource. One example of such a system 
having shared resources susceptible to conflicts is a network switch having multiple 
input ports and multiple output ports in which input ports make requests for 
connection to the output ports. Each requester or input port sends a request for an 
output port (or sends a set of requests for multiple output ports) to an arbiter. 

A conflict arises when a particular output port is requested by multiple input 
ports at the same time (assuming an output port can be allocated to only one input port 
at a time). To deal with the conflict, an arbitration decision awards the output port to 
one of the requesters. The arbiter chooses the winner such that resources (the output 
ports) are allocated to requesters in a conflict-free way. Thus, certain requests are 
serviced by being granted an output port and certain requests may be left unserviced 
in that they are denied an output port. However, the choice of which requests to grant 
may lead to under-utilization of the resources since some requests may be denied. 

Another example of a system having shared resources is a computer system in 
which multiple processors are coupled to multiple memories. Assume that each 
processor has access to all of the memories and each memory can only be accessed by 
one processor at a time. When multiple processors request access to the same 
memory at the same time, an arbitration decision has to be made as to which 
processor gets to access the memory in question. 
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One type of traffic that can be found on systems described above is 
isochronous traffic. Isochronous traffic has time constraints requiring the isochronous 
data to be dehvered within a certain time. For example, a video stream needs to be 
deUvered in time to be displayed on a screen without delay negatively affecting video 
5 image quality. Unfortunately, many existing systems are optimized for asynchronous 
traffic rather than for the requirements of isochronous traffic. Because of the time 
constraints, isochronous transfers add increased scheduling complexity in a system in 
which an arbiter allocates resources. 

Multicast transfers, which use one to many communication in which a single 
10 source communicates with multiple targets simultaneously, provide efficiency since 
such transfers save network bandwidth and reduce the time to execute the multicast. 
However, such transfers can also add increased scheduling complexity, particularly in 
a system that mainly supports unicast operations. That increased complexity arises in 
part because multicast requires that multiple resources be available simultaneously. 

15 Accordingly, it would be desirable to provide a way to more easily deal with 

scheduling complexities associated with isochronous traffic and/or multicast traffic in 
a system requiring arbitration for resoiu-ces. 

SUMMARY OF THE INVENTION 

Accordingly, in a first aspect of the invention, a method for allocating a 
20 plurality of resources in a communication network includes reserving a first portion of 
the resources for a particular time period on the network during a first arbitration 
phase, in response to prescheduling requests. During a second arbitration phase a 
second portion of the resources are allocated for the particular time period in response 
to regular requests. Data is transferred across the communication network according 
25 to the allocating of resources. 

In accordance with another aspect of the invention a network system is 
provided that includes a data transport medium attached to a plurality of sources and a 
plurality of targets. The system includes an arbiter coupled to receive first requests 
for transfers from one or more of the sources to one or more of the targets during a 
30 time slot on the data transport medium. The arbiter is fiirther coupled to receive 
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regular requests from the sources for transfers from one or more of the sources to one 
or more of the targets during the time slot. The arbiter allocates the targets to the 
sources in accordance with the first requests and then in accordance with the regular 
requests. The first requests may come in the form of a precalculated schedule. 

In accordance with still another aspect of the invention a method is provided 
for allocating a plurality of resources in an electronic system. The method includes 
allocating a first group of one or more of the resources in accordance with first 
requests for the resources, the first group being allocated for a particular time period; 
and subsequently allocating a second group of one or more of the resources for the 
particular time period, the first and second group of resources being mutually 
exclusive. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention may be better understood, and its numerous objects, 
features, and advantages made apparent to those skilled in the art by referencing the 
accompanying drawings in which the use of the same reference symbols in different 
drawings indicates similar or identical items. 

Fig. 1 shows a network system including a network switch that can 
advantageously be scheduled according to one or more embodiments of the present 
invention. 

Fig. 2 illustrates asynchronous or bursty traffic typically found on a network. 

Fig. 3 illustrates periodically scheduled isochronous traffic. 

Fig. 4 illustrates requests for prescheduled slots for isochronous or other 
periodic data. 

Fig. 5 illustrates a conflict free schedule generated in response to the requests 
shown in Fig. 4. 

Fig. 6 illustrates requests for prescheduled slots for multicast data. 
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Fig. 7 illustrates a conflict free schedule generated in response to the requests 
shown in Fig. 6. f 

Fig. 8 illustrates conflicting prescheduled requests in a system having four 
requesters and four resources. 

5 Fig. 9 illustrates operation of an exemplary arbitration scheme to allocate the 

resources in a conflict free and fair manner. 

Fig. 1 0 illustrates operation of an arbiter allocating prescheduled requests. 

Fig. 1 1 illustrates allocating regular requests following allocation of the 
prescheduled requests in Fig. 10. 

10 Fig. 12 illustrates operation of an arbiter allocating prescheduled requests in 

which one of the resources is inactive. 

Fig. 13 illustrates allocating regular requests following allocation of the 
prescheduled requests shown in Fig. 12. 

Fig. 14 illustrates a vector being sent to a central arbiter containing both 
1 5 prescheduled and regular requests. 

Fig. 15 illustrates a high level block diagram of an embodiment of a hardware 
arbiter for arbitrating regular requests. 

Fig. 16 illustrates a high level block diagram of an embodiment of a hardware 
arbiter for arbitrating prescheduled requests. 

20 Fig, 17 illustrates a computer system having multiple processors contending 

for multiple memory resources, which can advantageously exploit one or more 
embodiments of the invention to schedule the memory resources. 

DESCRIPTION OF THE PREFERRED EMBODIMENTrS^ 

Referring to Fig, 1, a network 100 is illustrated that has resources that can 
25 advantageously be scheduled according to one or more embodiments of the present 

invention. More particularly, arbiter 101 schedules usage of shared resources, i.e., the 
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data paths connecting input ports 103, 105 and 107 to output ports 109, 111 and 113 
in switch 115. Each input node 110, 112, 1 14 is connected to a corresponding input 
port. Since multiple requesters, i.e., multiple input nodes, can simultaneously request 
a shared resource, i.e., a connection to an output port, arbiter 101 allocates data paths 
through the network. The arbiter tries to connect input and output ports in such a way 
that as many packets as possible can be forwarded simultaneously, thus maximizing 
usage of shared resources. Each requester sends a set of requests to arbiter 101, 
which then chooses the requests to be granted such that resources are allocated to 
requesters in a conflict-free way. When input and output nodes communicate with 
arbiter 101, they are referred to as initiator and target, respectively. In a typical 
system, a node combines both an input node and an output node. 

In one embodiment of the invention, arbiter 101 simultaneously receives 
request signals 117 for shared resources from the various input nodes 110, 112 and 
114. Scheduling happens synchronously in that grant signals 119 are sent 
simultaneously and the usage interval for each resource has the same length. 
Scheduling may be fiirther constrained in that only one requester can use a particular 
resource at the same time. The main goal of an arbitration scheme typically is to 
achieve high aggregate usage of the resources while still providing a minimum level 
of faimess, so that the arbitration scheme avoids starvation of individual requests. 

There are many different arbitration schemes that arbiter 101 can implement to 
allocate data paths through the switch. For example, arbiter 101 can allocate the data 
paths by prioritizing requests from nodes based on the number of their requests. In 
such a scheme, highest priority is given to the requester with the fewest number of 
outstanding requests, and the lowest priority is given to the requester with the highest 
number of outstanding requests. Resources are scheduled one after the other in that a 
resource is allocated to the requester with the highest priority first. That way requests 
from the requester having the fewest choices are chosen first to be allocated. Thus, in 
such an embodiment, priority is inversely related to the number of requests being 
made. Requesters with many requests have more choices than requesters with few 
requests and, therefore, can be considered later and still have a reasonable likelihood 
of being granted one of their outstanding requests. That strategy increases the number 
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of granted requests and results in higher aggregate usage of system resources when 
compared with other arbitration schemes. 

In another embodiment, the arbitration scheme may allocate output ports 
based on the number of requests being made for a particular output port. Those 
output ports with the fewest requests are allocated first. A round robin scheme can 
also be used by the arbiter to avoid starvation in conjunction with those embodiments. 
Further details on an arbiter which may be used in some or all of the embodiments 
described herein, can be found in the patent application entitled "Least Choice First 
Arbiter", previously mentioned. Of course, one of ordinary skill would understand 
that many other arbitration schemes are known in the art and may be utilized in the 
various embodiments described herein. 

Network 100 typically carries several different kinds of data traffic. Referring 
to Fig. 2, one kind of traffic that is commonly carried in such data networks is 
asynchronous traffic. Asynchronous traffic tends to occur in bursts. Thus, as shown 
in Fig. 2, packet Po is followed by a burst of packets P1-P4 which are then followed 
after an interval by P5. Usage of network resources by such traffic is usually 
unpredictable. Many existing systems are optimized for such bursty traffic. 

Another kind of traffic network 100 typically carries is isochronous traffic, 
which may require that data packets be transferred at a fixed rate with as little jitter as 
possible. Referring to Fig.' 3, an example of such isochronous traffic is illustrated. 
Examples of such traffic include audio data, video data and animated graphics. Each 
isochronous packet needs to be received within time period T\. If not, jitter results, as 
is the case for packet P3, possibly resulting in degradation of video or audio quality. 

As previously mentioned, scheduling isochronous and/or multicast data can be 
burdensome for an arbiter optimized for unicast and/or asynchronous, non-periodic 
types of data transfers. In order to better deal with certain categories of data transfers 
such as data transfers involving isochronous data or multicast data according to an 
embodiment of the present invention, a precalculated schedule is established before 
the arbiter 101 arbitrates the "regular" requests, e.g., for asynchronous requests. The 
precalculated schedule may be calculated by one of the nodes in the form of a 
centralized scheduler or by all the nodes in the form of a distributed scheduler. In a 
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distributed scheduler, scheduling can be distributed by having each output node 
manage the output port it is connected to. 

The precalculated schedule may be used to schedule isochronous data to 
implement quality of service (QoS), e.g., transmission of audio or video streams 
5 across a network. The source of the stream, e.g., an input node in network 100, asks 
the scheduler to periodically reserve a switch slot. Assume that switch 115 is a 
synchronous switch that changes settings of the switching fabric periodically (the 
period being called a slot). A precalculated schedule can accommodate isochronous 
traffic by allocating the necessary connection between an input and output port at 

10 intervals derived from the rate of the isochronous data stream. That way, an 

appropriate amount of switch bandwidth can be reserved. For example, if B is the 
port or link bandwidth and b the bandwidth needed to transfer the isochronous data 
stream, the precalculated schedule has to reserve one slot every B/b slots. For 
example, if the link bandwidth is 2.5 Gbits/s and the stream requires a bandwidth of 

15 2.5 Mbits/s, the source of the stream asks the scheduler to reserve 1 out of every 1000 
slots. The slots can be reserved far in advance, possibly for the whole duration of the 
transfer and before the transfer is started. 

In an embodiment, the precalculated schedule may be communicated to the 
arbiter 101 with the help of request packets. For every slot on the switch, the arbiter 

20 receives one request packet containing regular requests from every node. That 

request packet may contain an additional vector of prescheduled targets as described 
further herein. The arbiter uses that information in that the arbiter does not allocate to 
regular requests output ports that are reserved by the precalculated schedule. That can 
be accomplished by the arbiter allocating the precalculated schedule as received or 

25 preferentially scheduling those requests that are indicated to be prescheduled requests. 

Another type of traffic that may be carried over network 100 (Fig. 1) is 
multicast data which uses multiple communications paths between one input port and 
multiple output ports. Thus, referring to Fig. 1, input port 103 may be coupled to send 
data to multiple ones of the output ports 109, 1 1 1 or 1 13. While multicast data can 
30 always be sent sequentially, it is more efficient to transfer multicast data in parallel 
since it saves network bandwidth and reduces the time needed to execute the 
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multicast. However, multicast can be difficult to implement in a system that more 
generally supports unicast operations. One difficulty is resource allocation since 
multicast requires multiple resources to be available simultaneously. Another 
complication arises with request-acknowledge protocols needed for reliable 
communication. More specifically, to determine that a multicast packet was 
successfiilly delivered, the source of the multicast needs to collect an acknowledge 
fi-om every destination that was addressed by the multicast packet. One solution to 
that problem is described in U.S. Application No. 09/659,106 entitled, "Reliable 
Multicast Using Merged Acknowledgements", filed September 11, 2000, naming 
Hans Eberle and Nils Gura as inventors. 

An embodiment of the present invention uses a centralized scheduler 
implemented in software to generate the precalculated schedule, which is then sent to 
the arbiter. The scheduler runs on one of the nodes connected to the network. The 
scheduler collects requests fi*om the other nodes to reserve slots on the network. 
Based on the requests and the existing allocations, a precalculated schedule is 
generated. A simple scheduling policy may consider requests on a first-come, first- 
serve basis and grant a request if the corresponding slots are available on the network. 
Another scheduling policy tries to maintain some degree of faimess by allocating 
equal shares of the switch bandwidth to each host. 

After being generated in the centralized scheduler, the precalculated schedule 
is communicated to the nodes fi-om the centralized scheduler such that for every slot, 
the nodes can inform the arbiter about the connections which have been prescheduled. 
Although the precalculated schedule is required to be conflict-fi-ee in one 
embodiment, the arbiter may still check to ensure that an output port connects to only 
one input port, and if necessary, drop conflicting connections. In that way, any errors 
that are present in the prescheduled transfers can be corrected. If the system does not 
provide a conflict- fi-ee precalculated schedule, the conflicts in the schedule can be 
resolved by the central arbiter. Once the pre-allocated slots are determined, the 
remaining output ports can then be scheduled by the arbiter. 
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In one embodiment the software scheduler distributes the precalculated 
schedule to the nodes, which in turn send the corresponding "prescheduled requests" 
together with their "regular requests" to the centralized arbiter (see Fig. 14). 

A precalculated schedule can also be used to schedule multicast packets. To 
make that possible, the precalculated schedule specifies multipoint connections, in 
which multiple output ports are connected to the same input port. Implementing 
multicast with the help of a soflrware prescheduler rather than the hardware arbiter is 
attractive since multicast connections constrain scheduling and therefore can be more 
easily handled in software. 

Referring to Fig. 4 and Fig. 5, an example of prescheduling isochronous data 
transfers is shown. In Fig. 4 two initiators lo and Ii are requesting connection to 
targets T2 and T3 during various time slots on the network. More particularly, initiator 
lo is requesting a connection to target T2 during slot 1 . A slot is a time period on 
switch 115 during which a particular input port is connected to an output port. 
Additionally, the request by lo includes a rate 2 and a length 3. The rate and length 
numbers are indicative of the periodicity of the request, which is explained more fiiUy 
with relation to Fig. 5. Initiator Ii is requesting connection to target T3, i.e., output 
port 3, during slot 1 . The request includes a rate of 0 and a length of 1 . In addition, 
initiator Ii is requesting connection to target T2 during slot 5 with a rate of 4 and a 
length of 5. 

Fig. 5 shows the conflict- firee schedule that results after the requests in Fig. 4 
are sent to a software scheduler executing in one of the nodes coupled to switch 115. 
Fig. 5 shows the precalculated schedule for source lo with respect to targets To, Ti, T2 
and T3, and the precalculated schedule, in the next column, for source Ii with respect 
to targets To, Ti, T2 and T3. During slot 0, no requests for precalculated slots were 
made and no slots are shown as allocated. During slot 1, the "0010" value in the lo 
column corresponding to slot 1 , indicates that source Iq is scheduled to be connected 
to target T2. During slot 3 and slot 5, target T2 is also allocated to lo. That is because 
the request included a rate of 2, meaning it was to be repeated every two slots, and a 
length of 3, meaning that it was to be repeated for a total of 3 slots. Thus, the request 
shown in Fig. 4 by source lo requested connections to target T2 during slots 1, 3 and 5, 
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The "0001" in the Ii column corresponding to slot 1 indicates that source Ii is 
scheduled to be connected to target T3. The rate of 0 and length of 1 shown in Fig. 4 
indicates that the request is not a periodic request as was the request by Iq. Slot 2 has 
all 0 entries since neither of the sources requested targets during that period. Referring 
5 again to Fig. 4, the second request by initiator or source Ii for target T2 during slot 5 is 
shown as shaded because it was not scheduled due to a conflict with the periodic 
request by initiator Iq. Thus, either the conflicting request is dropped entirely or it is 
scheduled after slot 5. That determination can be made based on system 
implementation, and factors such as the type of traffic requesting the slot. 

10 Referring to Figs. 6 and 7, illustrated are prescheduled requests and a resulting 

schedule for pre- allocated multicast transfers from sources Iq, Ii and I2 to targets Ti, 
T2 and T3. In the example shown in Figs. 6 and 7, the schedule generated is conflict 
firee. Referring to Fig, 6, initiator Iq is requesting cormection to two targets, T2 and T3, 
dxuing slot 1. Initiator Ii is requesting connection to three targets, Ti, T2 and T3, 

1 5 during slot 2, and initiator I2 is requesting connection to targets Ti and T3 during slot 
1 . The request by initiator I2 is shaded to indicate that that request is not scheduled by 
the scheduler due to a conflict with the request by initiator Iq since both are requesting 
the path to target T3 during the same slot. In the example given, the scheduler gives 
priority to the request by Iq. The request by initiator I2 may be delayed rather than 

20 canceled. 

Referring to Fig, 7, an exemplary precalculated schedule is illustrated that is 
generated in response to the requests shown in Fig. 6. During slot 0, since there were 
no requests, no transfers are scheduled. During slot 1, initiator Iq is shown in the lo 
column as being scheduled for connection to targets T2 and T3 as indicated by the 
25 "001 1". All other entries for slot 1 are 0. For slot 2, the precalculated schedule shows 
that initiator Ii is scheduled to be connected to targets Ti, T2 and T3, as requested. 

The schedules shown in Fig. 5 and Fig. 7 are non-conflicting schedules, which 
are provided to an arbiter such as arbiter 101 (Fig. 1) after they are generated, so that 
the arbiter can know which of the resources have been pre-allocated. The arbiter then 
30 continues with arbitration of the "regular" requests, i.e., the non-pre-allocated 

requests, to allocate the remaining resources. Since those resources that are pre- 
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allocated are not available, they are not considered; the pre-allocated requests are 
essentially given priority by the arbiter. The "regular" requests in one typical network 
are requests for asynchronous transfers, but they are not limited to such transfers. 

In other embodiments, the scheduler may not generate a conflict-free schedule 
5 for the various pre-allocated requests. Fig. 8 shows the requests by requestors lo, Ii, h 
and I3 for connection to targets To, Ti, T2 and T3 during slots 0, 1 and 2. The targets 
are represented by binary digits in the columns as shown by the "01 11" in the Iq 
column in Fig, 8. In slot 0, the requestor Iq is requesting connection to targets Ti, T2 
and T3. In slot 2, requestor lo is also requesting connection to targets To and Ti. 

10 Requestor Ii is requesting connection during slot 0 to targets To and T3. Requestor Ii 
is also requesting connection during slot 1 to To and T3. Requestor Ii is also 
requesting connection to targets T2 and T3 during slot 2. Requestor I2 is requesting 
connection during slot 1, to targets To, Ti and T2. Because certain of those requests 
conflict, an arbitration scheme must be utilized to determine which of the requests are 

15 going to be granted. 

Referring to Fig. 9, operation of an exemplary arbitration scheme is illustrated. 
The initiators I and targets T are shown in an array in which the requests for a target 
are indicated by 1 in the array location corresponding to the request. Thus, for slot 0, 
location [0,1] is shown having a 1 to indicate that initiator lo is requesting target Ti 

20 during slot 0. In order to determine which of the initiator's requests are considered 

first, a round robin approach may be utilized. In the exemplary embodiment shown, a 
round robin scheme is utilized in which the requests of the initiator designated by the 
round robin row are considered first. In Fig. 9, the current round robin row for 
arbitrating the requests for slot 0 is assumed to be initiator lo. Because lo has requests 

25 for targets Ti, T2 and T3, those requests are granted as indicated by the X in those 

positions. In contrast, the requests by initiator Ii for targets To and T3 do not have Xs, 
indicating that those requests were not granted. 

In arbitrating the requests for slot 1 , the round robin row has moved to initiator 
Ii and thus any requests by Ii are considered first. Because I\ has requests for To and 
30 . T3 in slot 1, those requests are granted as indicated by the X at positions [1,0] and 



[1,3]. 
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The requests for slot 2 are arbitrated as follows: the round robin row for slot 2 
dictates that the requests by I2 be considered first. Since there are no requests by h 
for targets in slot 2, the requests subsequently considered are by I3, lo and Ii, 
respectively. The requests by Iq for targets To and Ti and the requests by Ii for targets 
T2 and T3 are granted. 

While not shown in the examples in Figs. 6, 7, 8 and 9, multicast transfers can 
also be scheduled periodically as were exemplary isochronous transfers in Figs. 4-5. 
Thus, a multicast requestor may request, e.g., that it be connected to three targets 
every five slots for a duration of ten total slots. If such a request were granted, every 
fifth slot of the next 50 slots would be allocated to that periodic multicast request. 

Referring to Figs. 10 and 11, operation of a two phase arbitration scheme for 
multicast transfers according to one embodiment of the present invention is shovm. 
Fig. 1 0 illustrates generation of a pre-calculated schedule in a first phase, and Fig. 1 1 
shows scheduling of "regular requests" in a second phase. In Fig. 10, it can be seen 
that initiators Ii and I2 have requests for targets Ti and T2, and T2 and T3, respectively. 
As can be seen, the requests for T2 conflict. Accordingly, the arbiter has to allocate 
the resources to only one of the requests. Using a round robin scheme, in the first 
phase, the arbiter first checks the requests at the round robin location, in this example 
the roimd robin row is Ii. Because Ii has requests for resources Ti and T2, those 
requests are granted as indicated by the X at locations [1,1] and [1,2]. Since I2 has a 
request for target T2 that conflicts with a granted request of Ii, all of the requests oft 
are ignored. Thus, the pre-calculated schedule allocates targets Tj and T2 to requestor 
Ii for that particular slot. 

In the second phase, the central arbiter schedules the regular requests as shown 
in Fig. 11. The resources Ti and T2 granted to precalculated requests as shown in Fig. 
10 are shown in dashed lines in Fig. 1 1 indicating that they are unavailable for 
allocation. In the exemplary second phase illustrated, the arbiter starts evaluating 
requests at the round robin position [1,1] indicated by the circle at that position, which 
happens to be unavailable since Ti was allocated to a precalculated request. Next, the 
arbiter evaluates requests for T2. Again, T2 was allocated to a precalculated request 
and no fiirther scheduling decision needs to be made. The arbiter now moves on to 
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scheduling T3. Both Iq and I2 are requesting T3. Since I2 has fewer requests than Iq, I2 
is given priority and the request in position [2,3] is granted as indicated by the X. 
Note that in this embodiment all regular requests shown in Fig. 11 are considered 
when calculating the priorities of the initiators. An altemative embodiment only 
5 considers regular requests for resources that have not been granted either to a 

prescheduled request or to a regular request. In an embodiment (not shown in Figs. 
10 and 1 1), to avoid interference with the faimess guarantees provided when 
scheduling "regular" requests, the precalculated scheduler does not consider a request 
in the column with the round robin position of Fig. 1 1. In that embodiment, 
10 broadcasts including all targets are not possible. In yet another embodiment, the 

round robin position of Fig. 11 is not moved if the target that the round robin position 
points to was granted to a prescheduled request. 

Referring to Fig. 12, the first phase of another two phase arbitration 
embodiment is illustrated in which the targets may be inactive and thus not allocable. 
For example, targets are declared inactive by the arbiter 101 if the arbiter does not 
receive a configuration packet from the node indicating the node is active. Nodes are 
expected to send a configuration packet every arbitration cycle to indicate which 
targets they are requesting. In Fig. 12, the target Ti is inactive and unavailable. Thus, 
the request by Ii for target Ti shown in Fig. 12 at [1,1] is not granted because target Ti 
is inactive. In a manner similar as that described with relation to Fig. 10, the round 
robin row is at 1 1 and thus, the requests by initiator Ii are considered first, resulting in 
the request at [1,2] being granted as indicated by the X at that position. Thus, a pre- 
calculated schedule is determined that grants target T2 to requestor Ii. 

Referring to Fig. 13, the operation of the arbiter to schedule the regular 
25 requests is illustrated. Target Ti is illustrated as an inactive resource. Target T2 is 
marked as a prescheduled resource, and thus unavailable since that target was 
allocated to Ii in the pre-allocated schedule. The arbiter begins at the round robin 
position [2,0] indicated by the circle at that position. As can be seen by the 0 at that 
position there is no request. The arbiter then proceeds to determine the remaining 
30 requests for that target. The requests that are active for that target are the requests at 
[0,0] and [3,0]. Since lo has fewer requests than I3, it is given priority and To is 
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allocated to Iq. T3 is the last resource to be scheduled. There is only one request at 
[3,0] to be considered. It is granted and T3 is allocated to I3. 

The requests for prescheduled slots can be provided to the centralized 
scheduler residing in one of the hosts through packets including the information such 
as that shown in Figs. 4 or 6. Referring to Fig. 14, an exemplary set of vectors 
illustrating regular and prescheduled requests provided directly to the central arbiter 
such as 101 is shown. The requests correspond to the prescheduled and regular 
requests made in Figs. 12 and 13. 

While some embodiments of the invention may utilize a hardware centralized 
arbiter for scheduling regular requests and a software scheduler for scheduling 
prescheduled requests, other embodiments may employ entirely hardware or entirely 
software arbiters, or a mixture of the two according to system requirements. 
Operation of an exemplary centralized arbiter is illustrated in the software listing in 
Appendix A. Array P indicates whether a requester (also referred to herein as a 
source or initiator) has a prescheduled connection with a resource (also referred to 
herein as a target). Procedure arbitrate determines if the prescheduled requests have 
any conflicts and provides a conflict fi-ee allocation of resources. Array P may be 
generated by a scheduler in one of the nodes and may already be conflict free. The 
procedure then goes on to allocate "regular requests" that are contained in array R. If 
a resource is inactive, the corresponding requests in P and R are ignored and the 
resource is not granted. 

Appendix B illustrates an exemplary embodiment of a software prescheduler. 
In the software prescheduler, fimction conflict checks whether a request conflicts with 
the transfers that are already scheduled. If there is no conflict, the requested transfer 
is added to the list of scheduled transfers and the fimction returns true, otherwise, if 
there is a conflict, the fimction returns false. Function schedule continues to call 
fimction conflict by increasing the start frame of the request until the request can be 
scheduled or the deadline is exceeded. Conflicts are checked as follows. Given a 
request specified by slot s, rate r and length 1, as described in Appendix B, and a 
scheduled transfer specified by parameters t^.s, t'^'.r, t^l, as described in Appendix B, a 
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a conflict arises if equation (s + m*r = t^.s + n*t'^.r) holds for integer numbers 
m = 0 . . 1-1 and n - 0 . . t'^.l-l (where * indicates multipUcation). 

Appendix B illustrates only one example of many possible approaches to the 
prescheduling problem described herein. Algorithms to address this general type of 
scheduling problem are know as "earliest deadline first" or "static priority" algorithms 
and are well known in the art. Such algorithms can be utilized for the software 
prescheduler in embodiments of the invention. 

An exemplary hardware centralized arbiter in shown in Fig. 15. Assume there 
are MaxReq inputs to the arbiter (i.e. MaxReq is the maximum number of requesters). 
Fig. 15 illustrates hardware 1500 associated with the first requester (input 0), which 
corresponds to requester 0 and hardware 1502 associated with the last requester 
(MaxReq- 1). The embodiment illustrated in Fig. 15 operates as follows. Each of the 
elements of R[i,*], which includes elements 1503 and 1504, is initialized, where * 
represents the number of resources. Thus, for example, element 1503 contains all 
requests input by the associated requester 0 for the various resources. In addition, the 
array S, which includes elements 1509 and 1511 of the illustrated hardware, is reset. 
The requests for the illustrated elements 1503 and 1504 are summed in summers 
1505, 1506 to generate a number of requests (NRQs) 1507 and 1508 for the respective 
requesters. If R[I-i-res, J+res] = 1, then grant (GNT) 1510 is set to (I+res) through 
multiplexer 1512. That is, the round robin position is granted. Otherwise, grant 
(GNT)1510 is set to the input with the minimum number of requests, which is 
determined in minimize circuit 1514. If there is no request for the current output, 
grant is set to -1 (note that the logic to set grant to -1 is assumed to be contained 
within grant (GNT) 1510). If grant (GNT) 1510 is not -1, then the location S[gnt] 
(e.g. 1509) is set to J+res, which indicates the resource granted to that requester. Each 
of the elements of R[gnt, *] is set to 0, where * represents the resources. The register 
1516 holding "res" is incremented so res = res + 1. The hardware continues the 
calculations for S until res = MaxRes, that is, the last resource is (J+MaxRes-1) mod 
MaxRes. The register 1518 containing the index I is incremented so I = I+l. If I = 
MaxReq, then I is set to 0 and the register 1520 containing J is incremented. The 
embodiment illustrated in Fig. 15 is an illustrative block diagram and does not show 
all the details described. As would be known to those of skill in the art, those details 
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along with many other hardware implementations can be provided to implement the 
various embodiments described herein. 

Fig. 16 illustrates an exemplary hardware arbiter for processing prescheduled 
requests. Registers 1601 (PR[0,*]) store prescheduled requests of requester 0, which 
may include requests for up to MaxRes resources as indicated by *. The requests 
from the intermediate requesters are not shown. The requests from the final requester 
(MAXREQ-1) are stored in registers 1603. Counters 1605 and 1607 indicate which 
request REQ is being considered along with the offset I that determines the round 
robin row. The ACT registers 1609 and 1611 provide an indication as to whether a 
particular resource is active. Based on the requester being considered provided by 
count registers 1605 and 1607, a particular request vector is selected by multiplexer 
1613 and provided to AND gates 1615 which AND together an indication of a request 
for a resource and whether that resovuce is active. The outputs of AND gates 1615 
are provided to schedule registers 1617, which store an indication of those resources 
that have been scheduled. If a resource has already been scheduled, the logic 1619 
prevents a subsequent request for an active resource from being set in schedule 
register 1617. Note that in the embodiment shown, if one request of a requester is 
already scheduled, then all of the requests from that requester are ignored. Otherwise, 
the schedule register 1617 is updated by the requests of the current requester. Note 
that register 1617 contains sticky bits, that is, once a bit is set, it remains set until the 
end of the arbitration cycle. 

If the requests by the requester are set in schedule registers 1617, the GNT 
registers 1621 and 1623 are appropriately set according to comparators 1625 and 
1627. AND gates 1629 and 1630 provide the appropriate gating function to set or not 
25 set the grant registers. The embodiment illustrated in Fig. 16 is an illustrative block 
diagram and does not show all the details of the circuit. As would be known to those 
of skill in the art, those details along with many other hardware implementations can 
be provided to implement the various embodiments described herein. 

Referring to Fig. 17, another embodiment of the invention is illustrated in 
30 which processors 1701, 1703, 1705 contend for access to memory resources 1707, 
1709, 1711 through the buses 1700 via arbiter 1713. In certain applications, e.g., 
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Streaming video, one of the processors may wish to preschedule memory accesses to 
ensure the streaming video can be displayed satisfactorily. Thus, one of the processor 
nodes may function as a prescheduler. That node receives requests from the various 
nodes and generates a schedule of preallocated requests, which is supplied to the 
5 arbiter 1713. Arbiter 1713 also receives regular requests from the processor nodes for 
memory access and allocates memory accesses according to the prescheduled requests 
and the regular requests. 

Thus, an approach has been described that facilitates scheduling of resources 
by an arbiter by providing prescheduled requests, along with regular requests. The 
10 use of prescheduling can be particularly usefiil where arbitration is complicated by, 
e.g., the periodic nature of the requests or by multicast operations. 

The description of the invention set forth herein is illustrative, and is not 
intended to limit the scope of the invention as set forth in the following claims in 
which variations and modifications of the embodiments disclosed herein, may be 
1 5 made based on the description set forth herein, without departing from the scope and 
spirit of the invention as set forth in the following claims. 
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A ppendix A 



1 


var 




2 


1: 0..MaxRq-1; 


(* round-robin requester offset *) 


3 


J: O..MaxRs-1; 


(* round-robin resource offset *) 


4 


IP: O..MaxRq-1 ; 


(* round-robin requester offset for precalculated 






schedule *) 


5 


rq: 0..MaxRq-1; 


(* requester *) 


6 


rs, r: 0..MaxRs-1; 


(* resource *) 


7 


gnt:-1..MaxRq-l; 


(* granted request *) 


8 


min: O..MaxRs+1; 


(* minimum number of requests *) 


9 


P: array [O..MaxRq-1, O..MaxRs-1] of boolean; 


(* P[l,j] is true If requester i has a prescheduled 






connection with resource j *) 


10 


R: array [O..MaxRq-1 , 0..MaxRs-1 ] of boolean; 


(* R[i,j] Is true If requester i Is requesting resource j *) 


11 


Nrq: array [0..MaxRq-1] of O..MaxRs; 


(* NRqfl] is the number of resources requested by 


requester i *) 




12 


Gnt: array [O..MaxRq-1 , O..MaxRs-1] of boolean; 


(* Gntfi.il Is true if requester 1 was granted resesource j *) 


13 


Mux: array [O..MaxRs-1] of -L.MaxRq; 


(* Mux[i] contains the mux select for resource i *) 


14 


Act: array [O..MaxRs-1] of boolean; 


(* Act[i] is tme if resource 1 Is active *) 


15 


ovi: boolean; 


(* ovl Is true if current request overlaps with previously 






scheduled ones *) 


16 


procedure arbitrate; 




17 


begin 




18 


for rq := 0 to MaxRq-1 do 




19 


for rs := 0 to MaxRs-1 do 




20 


Gnt[rq,rs] := false; 


(* Initialize grants *) 


21 


for rs := 0 to MaxRs-1 do 


(* Initialize mux selects *) 


22 


Mux[rs] := -1; 




23 


for rq := 0 to MaxRq-1 do 


(* check precalculated schedule *) 


24 


begin 




25 


ovI := false; 




26 


for rs := Oto MaxRs-1 do 


(* ignore requester if it has a preschuled connection 






with a resource *) 


27 


If P[(rq+IP) mod MaxRq.rs] and (Mux[rs] <> 


-1 ) then (* that has already been assigned *) 


28 


ovI := true; 




29 


if not ovl then 




30 


forrs:= Oto MaxRs-1 do 


(* assign requested resources *) 


31 


if P[(rq-+-IP) mod MaxRq,rs] and Act[rs] then 




32 


begin 




33 


Mux[rs] := (rq+IP) mod MaxRq; 




34 


Gnt[{rq-HlP) mod MaxRq.rs] := true; 




35 


for r := Oto MaxRs-1 do 




36 


R[(rq+IP) mod MaxRq.r] := false; 




37 


end; 




38 


end; 
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Appendix A (continued) 

39 forrq:=0toMaxRq-1 do 

40 begin 

41 Nrq[rq] := 0; 

42 for rs := 0 to MaxRs-1 do 

43 if R[rq,rs] and Act [rs] then Nrq[rq] := Nrq[rq] + 1; (* calculate number of requests for each 

requester *) 

44 end; 

45 for rs := 0 to MaxRs-1 do (* allocate resources one after the other *) 

46 if (Mux[(J+rs) mod MaxRs] = -1 ) and Act[(J+rs) mod MaxRs] then (* ignore requests for prescheduled 

or inactive resources *) 

47 begin 

48 gnt:=-1; 

49 if R[(l+rs) mod MaxRq,(J+rs) mod MaxRs] then (* round-robin position wins *) 

50 gnt := (l+rs) mod MaxRq 

51 else (* find requester with smallest 

number of requests *) 

52 begin 

53 min := MaxRs+1 ; 

54 for rq := 0 to MaxRq-1 do 

55 begin 

56 if (R[(rq+l+rs) mod MaxRq,(rs+J) mod MaxRs]) and (Nrq[(rq+l+rs) mod MaxRq] < min) then 

57 begin 

58 gnt := (rq+l+rs) mod MaxRq; 

59 min := Nrq [(rq+l+rs) mod MaxRq]; 

60 end; 

61 end; 

62 end; 

63 ifgnto-lthen 

64 begin 

65 Mux[(rs+J) mod MaxRs] := gnt; 

66 Gnt[gnt,(rs+J) mod MaxRs] tnje; 

67 for r := 0 to MaxRs-1 do R[gnt. r] := false; 

68 Nrq[gnt] := 0; 

69 for rq := 0 to MaxRq- 1 do 

70 if R[rq, (rs+J) mod MaxRs] then Nrq[rq] := Nrq[rq]-1 ; 

71 end; 

72 end; 

73 I := (1+1) mod MaxRq; 

74 if I = 0 then J := (J+1 ) mod MaxRs; 

75 IP :=(IP+1) mod MaxRq; 

76 end; 
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Appendix B 



1 program scheduler(output); (* simple first come first serve scheduler *) 

2 type 

3 Tfr = record 

4 s, r, I, p: integer; (* start, rate, length, port *) 

5 nxt: TfrPtr; (* pointer to next transfer descriptor *) 

6 end; 

7 TfrPtr = ^Tfr; 

8 var 

9 h: TfrPtr; (* pointer to head of list of scheduled transfers *) 

10 function conflict(s, r, I, p: integer): boolean; (* true if transfer can be scheduled without collisions *) 

11 var 

1 2 c: boolean; (* true if transfers collide *) 

13 t: TfrPtr; 

14 m,n: integer; 

1 5 begin 

16 c := false; 

17 t:=h; 

18 while not c and (f'.nxt <> nil) do 

1 9 begin 

20 if p = t^.p then (* request is competing for a scheduled output port *) 

21 begin 

22 If not (((s+r*(l-1)) < t^.s) or (s > (t".s+t".r*(t".l-1)))) then (* transfers overlap *) 

23 begin 

24 m:=0;n:=0; 

25 if (s>t^.s)then 

26 n:=abs(s-t^.s)dlvt^.r 

27 else 

28 m := absts-t'^.s) divr; 

29 while ((m<l) and (n<t''J)) and (s+m*r <> t'^.s+nn"' j) do 

30 if s+m*r > t"'.s+n*t"'.r then 

31 n:=n+1 

32 else 

33 m:=m+1; 

34 c:=((m<l)and(n<t"J)); 

35 end; 

36 end; 

37 t:=t^.nxt; 

38 end; 

39 if not c then (* schedule transfer *) 

40 begin 

41 new(t); 

42 t^.s := s; t^.r := r; t^.l := I; t^.p := p; t'^.nxt := h; 

43 h:=t; 

44 end; 

45 return not c; 

46 end; 

47 function schedule(s, r, I. d, p: integer): integer; (* start, rate, length, deadline, port *) 

48 begin 

49 while s+(l-1)*r<=d do 

50 begin 

51 if conflict(s, r, I, p) then 

52 return s 

53 else 

54 s:=s+1; 
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55 end; 

56 return -1 ; 

57 end; 

58 begin 

59 new(h); h^.nxt := nil; {* dummy element *) 

60 end. 
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